SITE LINK

KMID : 1022420140060010077

Phonetics and Speech Sciences
2014 Volume.6 No. 1 p.77 ~ p.83

Automatic Clustering of Speech Data Using Modified MAP Adaptation Technique

Ban Sung-Min

Kang Byung-Ok
Kim Hyung-Soon

Abstract

This paper proposes a speaker and environment clustering method in order to overcome the degradation of the speechrecognition performance caused by various noise and speaker characteristics. In this paper, instead of using the distancebetween Gaussian mixture model (GMM) weight vectors as in the Google’s approach, the distance between the adapted meanvectors based on the modified maximum a posteriori (MAP) adaptation is used as a distance measure for vector quantization(VQ) clustering. According to our experiments on the simulation data generated by adding noise to clean speech, theproposed clustering method yields error rate reduction of 10.6% compared with baseline speaker-independent (SI) model,which is slightly better performance than the Google's approach.

KEYWORD

speech recognition, speech data clustering, KL divergence, MAP adaptation

FullTexts / Linksout information

Listed journal information

site infomation

Prohibition of Unauthorized Collection of E-mail Addresses, medric.kyung@gmail.com
N4 301, Chungbuk National University, Chungdae-ro 1, Seowon-Gu, Cheongju, Chungbuk 28644, Korea